Application of Pronominal Divergence and Anaphora Resolution in English-Hindi Machine Translation

نویسندگان

  • Kamlesh Dutta
  • Nupur Prakash
  • Saroj Kaushik
چکیده

So far the majority of Machine Translation (MT) research has focused on translation at the level of individual sentences. For sentence level translation, Machine Translation has addressed various divergence issues for large variety of languages; the issue of pronominal divergence has been presented only recently. Since the quality of translation as required by users follows coherent multi-sentence discourse structure in a specific context, the pronominal divergence helps us in understanding the nuances of translation arising out of disparity in the languages. Subsequently using clues from this divergence, the anaphora resolution system can find the correct interpretation for the given pronominal referents and other entities by resolving the inter-sentential context. In the literature, researchers have examined the issue and have proposed ways for their classification and resolution of anaphora. However for Indic languages, not many studies are available. In this paper, we discuss different aspects of pronominal divergence that affects the anaphora resolution in English Hindi Machine Translation (EHMT). The study shall be helpful in developing approaches that can explicitly use inter-sentential information in order to resolve specific types of ambiguity and which can generate coherent multi-sentence discourse structure in the target language to produce higher quality of translation Machine Translation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploring Semantic Information from Hindi Dependency Treebank for Resolving Pronominal Anaphora

Anaphora Resolution is exigent task in almost all NLP applications such as text summarization, machine translation, information extraction, question-answering systems, etc. A lot of work has been done for identifying and still more need to be done for finding the factors responsible for resolving the anaphoras in all languages. An attempt has been made to resolve Hindi pronominal anaphora using...

متن کامل

Proposal of an English-Spanish Interlingual Mechanism Focused on Pronominal Anaphora Resolution and Generation in Machine Translation Systems

In this paper an interlingual mechanism oriented to pronominal references resolution and generation in Machine Translation (MT) systems is proposed. This mechanism is based on Slot Structure (SS) presented in [3] [2]. A comparison of pronominal references resolution both in English and in Spanish is developed to accomplish a study of the existing discrepancies between two languages. From this s...

متن کامل

Machine Learning Approach for Resolving Pronominal Anaphora Using Hindi Dependency Treebank

Machine Learning facilitates the computers to mimic human intelligence by applying a set of rules to massive amounts of trained data and identifying patterns to make decisions and adapt based on what patterns are still uncovered. A number of applications ranging from spam detection, facial recognition, product recommendations to credit-card fraud detection, all of them apply machine learning pr...

متن کامل

Modelling pronominal anaphora in statistical machine translation

Current Statistical Machine Translation (SMT) systems translate texts sentence by sentence without considering any cross-sentential context. Assuming independence between sentences makes it difficult to take certain translation decisions when the necessary information cannot be determined locally. We argue for the necessity to include crosssentence dependencies in SMT. As a case in point, we st...

متن کامل

Pronominal Reference Type Identification and Event Anaphora Resolution for Hindi

In this paper, we present hybrid approaches for pronominal reference type (abstract or concrete) identification and event anaphora resolution for Hindi. Pronominal reference type identification is one of the important parts for any anaphora resolution system as it helps anaphora resolver in optimal feature selection based on pronominal reference types. We use language specific rules and feature...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Polibits

دوره 39  شماره 

صفحات  -

تاریخ انتشار 2009